e#: Sharper Expertise Detection from Microblogs

نویسندگان

  • Thibault Sellam
  • Martin Hentschel
  • Vasileios Kandylas
  • Omar Alonso
چکیده

Microblogging platforms such as Twitter provide low cost access to an immense reserve of authoritative professionals, opinion leaders and hobbyists for a wide range of topics. Yet, as microposts are short and incredibly diverse, many of these experts are hidden. In this paper, we present e#, a system to retrieve experts automatically for a given set of keywords. Our design targets exhaustivity: e# can detect previously undetectable experts. The core idea is to enhance a state-ofthe-art expert detection algorithm with a graph of expertise domains. Our system produces this graph from hundreds of Gigabytes of Web search query logs and behavioral data, processed in a distributed, parallel fashion. We provide a detailed description of our architecture, including an original SQL-based community detection algorithm. We then benchmark our system with 750 queries, using crowdsourcing. We observe that e# finds many more experts than a state-of-the-art baseline.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Events Detection from Microblog Messages

Microblogs have dramatically changed the mechanism of information propagation. It has been an inevitable issue for governments and enterprises to face the challenges that microblogs bring to public safety management. Since 2010, a lot of emergent events were firstly reported in microblogs, and this trend is even becoming more and more prominent. As microblogs have shown much influence in public...

متن کامل

Finding Bursty Topics from Microblogs

Microblogs such as Twitter reflect the general public’s reactions to major events. Bursty topics from microblogs reveal what events have attracted the most online attention. Although bursty event detection from text streams has been studied before, previous work may not be suitable for microblogs because compared with other text streams such as news articles and scientific publications, microbl...

متن کامل

Real-time Detection and Sorting of News on Microblogging Platforms

Due to the increasing popularity of microblogging platforms (e.g., Twitter), detecting realtime news from microblogs (e.g., tweets) has recently drawn a lot of attention. Most of the previous work on this subject detect news by analyzing propagation patterns of microblogs. This approach has two limitations: (i) many non-news microblogs (e.g. marketing activities) have propagation patterns simil...

متن کامل

Geo-spatial Domain Expertise in Microblogs

In this paper, we present a framework for describing a user’s geo-spatial domain expertise in microblog settings. We investigate a novel way of casting the expertise problem by using points of interest (POI) as a possible categorization of expertise. To this end, we study a large-scale sample of geo-tagged tweets and model users’ location tracks in order to gain insights into their daily activi...

متن کامل

I See a Car Crash: Real-Time Detection of Small Scale Incidents in Microblogs

Microblogs are increasingly gaining attention as an important information source in emergency management. Nevertheless, it is still difficult to reuse this information source during emergency situations, because of the sheer amount of unstructured data. Especially for detecting small scale events like car crashes, there are only small bits of information, thus complicating the detection of rele...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016